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METHOD FOR GENERATING AN IMPROVED IMAGE SIGNAL WHEN 
ESTIMATING THE MOTION OF IMAGE SEQUENCES, IN PARTICULAR A 
PREDICTION SIGNAL FOR VIDEO IMAGES USING MOTION-COMPENSATING 

PREDICTION 



^■& ackg^ound — Information ' 



The present invention relates to a method for generating an 
improved image signal when estimating the motion of image 
sequences, in particular a prediction signal for video 
images using motion-compensating prediction, with motion 
vectors, which, for each picture block of a current image, 
indicate the position of the picture block used for the 
prediction with respect to a chronologically preceding 
reference image, being formed for picture blocks. 

European Patent ^EP 0 550 022 D2 describes a method for 
improving /motion estimation in image sequences, in half-pel 
accuracy, according to the full-search method. There, in a 
first process step, the search area, and in a second process 
step, the match block are filtered with the aid of an 
additional digital filter which enables a raster shift of 
the pixel raster by H pel. Using this measure, a distortion 
of the motion vector field can be ruled out. 

^ In -fefee. "MPEG-4 Video Verification Model Version 7.0", 
Bristol, April 1997, MPEG 97/N1642 in ISO/IEC JTC1/SC 
29/WG11, an encoder and decoder for object-based coding of 
video image sequences are specified. In this context, one no 
longer encodes and transmits rectangular pictures of a fixed 
size to the receiver, but instead, so-called "VIDEO OBJECTS" 
(VO) of any shape and size. The image formation of such a VO 
in the camera image plane at a specific instant is referred 
to as a VIDEO OBJECTS PLANE (VOP) . Consequently, the 



relation between VO and VOP is equivalent to the relation 
between image sequence and image in the case of the 
transmission of rectangular pictures of fixed size. 

The motion-compensating prediction in the verification model 
is carried out with the assistance of so-called *blockwise 
motion vectors" which, for each block of the size 8x8 or 
16 x 16 pixels of the current image, specifies the position 
of the block used for the prediction in an already- 
transmitted reference image. In this context, the resolution 
of the motion vectors is limited to half of a pixel, pixels 
between the scanning raster (half-pixel position) being 
generated by a bilinear interpolation filtering from the 
pixels on the scanning raster (integer pixel position) 
(Figure 1). In this case, + indicates the integer-pixel 
position, and O the half-pixel position. Interpolated values 
a, b, c, and d in half-pixel position are derived from the 
following relations : 

a = A, b = (A + B)//2, c = (A + C)//2, 

d == (A+B + C + D)//4, // indicating a rounded integer 
division . 

Summary of the — fe¥e ntion' 

f\ By applying the A mcasurco of the present invention, one can 
improve the quality of the prediction signal and, thus, the 
coding efficiency. In so doing, a greater local neighborhood 
is considered than in the case of bilinear interpolation, to 
generate pixels between the pixel scanning raster. The 
aliasing-reducing interpolation filtering according to the 
present invention leads to an increased resolution of the 
motion vector and, consequently, to a prediction gain and an 
increased coding efficiency. In the present invention, the 




FIR filter coefficients can be adapted to the signals to be 
coded, and be transmitted separately for each video object, 
thereby further increasing coding efficiency and enhancing 
the flexibility of the method. 

5 

In contrast to the design approach according to European 
|\ Patent r&P — 6 — 55-8 — 9 22 D i , there is no need.design any 

additional polyphase filter structures for intermediate 
positions having H pel pixel resolution in the horizontal 
10 and vertical directions. 



fa By applying the 
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e-s of the present invention, the image 



sequence frequency of an MPEG-1 coder can be doubled from 25 
Hz to 50 Hz, with the data rate remaining constant. In the 
case of an MPEG-2 coder, the data rate can be reduced by up 
to 30%, with the image quality remaining constant. 

:ief Description of the Drawings 

Exemplary embodiments of the present invention are explained 
in the fallowing in greater detail, on the basis of the 
drawing, wnose figures show: 

Figure 2 an interpolation diagram for pixels between the 

pixel scanning raster; 
Figure 3 the structure of an FIR filter for interpolations- 
Figure 4 a further interpolation having an even higher 

resolution . 
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Description of Exemplary Embodiments 



In the case of the method according "no the present 
invention, motion vectors are formed foi\picture blocks, the 



3 



motion vectors,, for each picture block of a current image, 
indicating the position of the picture block used for the 
prediction with respect to a chronologically preceding 
reference image. ! 
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The motion vectors for the prediction are determined in 
three successive steps: 

In a first search step, a motion vector is determined for 
each picture block with pel accuracy in accordance with a 

10 conventional method, for example, in accordance with the 
full-search block matching method. In this context, the 
minimum error criterion is determined for possible motion 
positions, and the vector which best describes the motion- of 

« A the picture block is selected (European Patent. ^P 0 368 — 3r!ri 

=55 ft . 

OH 

: 

!ff In a second search step, which, again, is based on such a 

y ? 

yj search for the minimum error criterion, an improved motion 

q vector is ascertained with sub-pel accuracy, starting out 

USO from the motion vector ascertained in the first step, using 

O 

m an aliasing-reducing interpolation filtering, with the aid 

!rf of a digital, symmetric FIR (finite impulse response) 

filter. In the process, a higher resolution is selected than 
in the first search step. Preferably, one selects a 

25 resolution of a half pixel relative to the pixel raster. 

Figure 2 illustrates the interpolation pattern for pixels b, 
c, and d between the pixel raster, as derived from the 
neighboring pixels A, B, C, D, E, F, G, H on the pixel 
raster. + indicates the integer-pixel position, 0 the half- 

30 pixel position, so that: 

b = (C01x(A_ a + A +1 ) + C02x (A_ 2 + A +2 ) + C03x (A_ 3 + A +3 ) + 
C04x (A_ 4 + A +4 ) ) /256 
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c ± = {C01x(A ± + E ± ) + C02x(B i + F ± ) + C03x (p ± + G L ) + G04x(D ± 
+ HJ ) /256 

d = (C01x(c_! + c +1 ) + C02x(c_ 2 + c +2 ) + C03x (c_ 3 + c +3 ) + 
C04x (c_ 4 + c +4 ) ) /256 

5 

The structure of the FIR interpolation filter used is 
apparent in Figure 3. Following each pixel position 5p, it 
branches to a coefficient evaluator 1, 2, 3, etc., and has a 
summing device 10 at the output. As is apparent from the 
10 above relationships, a greater local neighborhood is 

considered when generating pixels between the scanning 
raster, than in the case of the bilinear interpolation 
according to the related art. In this context, the 
interpolation filter coefficients C02, C03, C04 are 
435 determined in a way that minimizes the interpolation error 
rii performance. The coefficients can be determined directly 
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using the known estimation method of the least root-mean- 
square error. From the minimization of the interpolation 
error performance, one obtains a linear system of equations, 
whose coefficients can be derived from the principle of 
orthogonality. A set of FIR filter coefficients optimized in 
this manner is given by the coefficients COl = -.161/256, C02 
= -43/256, C03 = 23/256, C04 - -8/256. 



25 In the third search step, starting from the motion vector 

determined with an accuracy of H pel, a local search is 
performed using a further interpolation filtering, taking 
the eight neighboring pixels as a basis, with resolution 
that is increased still further, preferably to H pixel. As 

30 before, one selects the motion vector having the lowest 
prediction error performance. 



Figure 4 shows the interpolation pattern for this. The 
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integer pixel positions are marked by*X, the half-pixel 
positions by 0, and the quarter-pixel positions by -. 0 
indicates the best compensation with ^ pixel, and + the 
quarter-pixel search position. 

The interpolation is carried out relative to the pixel 
raster, with a half-pixel resolution from the second search 
step, using filter coefficients C01 ' = 1/2, C02 1 = 0, C03 ? = 
0, C04 f = O. 

The same previously introduced interpolation technique is 
used for the motion-compensating prediction. 

If the processing is carried out within a coder having a 
reduced image format (SIF format within an MPEG1 coder or Q- 
CIF in an H.263 coder), but the original input format is 
used for the display, for example, CCIR 601 [1] in the case 
of MPEG-1 or CIF in the case of H.263, a local interpolation 
filtering must be carried out as a post-processing. The 
described aliasing-compensating interpolation filtering can 
be used for this purpose as well. 

To activate the aliasing-compensating interpolation using 
1/4 resolution, activation bits can be inserted into an 
image-transmission bit stream. 

To predict video objects, filter coefficients COl through 
C04, and COl' through C04 ' can be separately conditioned for 
each of the video objects VO, and inserted into the image- 
transmission bit stream at the beginning of transmission of 
the video object in question. 

For the encoding of a motion vector, the range of values of 



the motion vector differences to be coded can be adapted to 
the increased resolution. 
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